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Abstract 

In the context of algebraic statistics an experimental design is described by a set of polynomials called the design ideal. This, in 
turn, is generated by finite sets of polynomials. Two types of generating sets are mostly used in the literature: Grobner bases and 
indicator functions. We briefly describe them both, how they are used in the analysis and planning of a design and how to switch 
between them. Examples include fractions of full factorial designs and designs for mixture experiments. 
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1 Introduction 



In the algebraic statistics literature two typ e s of po l ynomial repre s entati ons of an experimental design are stud- 
ied: the Grobner ty pe (see Pistone and Wvnn (1996), Pisto ne et al.l ([2001)) and the indicator function type (see 
Fontana et al. ( 20001) . Ye ( 20031) . Pistone and Rogantin d2007bl) ). In this paper we compare them, describe how to 
derive them from the design points and how to use them in the analysis of the design properties by unifying and com- 
pleting results from the literature. Mainly we provide an original and efficient algorithm to switch between the two 
representations. The diagram below summarizes the paper. 



Points coordinates 

I 

Indicator function 



Generating set 

X I 

Grobner representation 



In Section [2] the two representations are described and their relative practical advantages are discussed. Algebraic 
algorithms to move along the four down-arrows of the diagram are discussed. In Section[3]a theorem and an algorithm 
to change representation are given which do not require the knowledge of the coordinates of the points. This is 
represented by the horizontal arrows in the diagram. 

The horizontal arrows are particularly important in the planning stage of the experiment. This is because designs 
with a given confounding structure can be easily defined through generating sets and actual point coordinates are 
unknown until the corresponding system of equations is solved. The actual number of points in the design can be 
computed from the design ideal using the Hilbert function, as we do in Section|4] Analogue theorem and algorithm for 
mixture experiments are presented in Section [4] An implementation of the algorithms in the general-purpose mathe- 
matics software package Maple is provided in the Appendix. In Section a large design for a screening experiment 
from the chemical literature is studied. 

We us e the dedicated sy mbolic softwares GdGoA, see ICoCoATeaml (2005), and the general purpose software 
Maple, see lChar et al.l (1199 II) . The provided algorithms can be easily implemented in other softwares. 
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Example 1. To illustrate the main points of our discussion we use the two simple designs, Ta and Tp below 



T A = {(1,0), (-1,0), (0,1), (0,-1)} 

T P = {(1,0, 0), (0, 1, 0), (0, 0, 1), (1/3, 1/3, 1/3), (1/2, 1/2, 0), (1/2, 0, 1/2), (0, 1/2, 1/2)} 
Note that the components of each point of Tp sum to one. 



□ 



2 The two representations 

We begin w ith some unavoidable a lgeb raic notions. Relevant references to polynomial algebra can be found in 
the textbooks bv lCox et alj dl997ll2005b and lKreuzer and Robbianol(l2000il2005l) . 

Let A: be a computable numerical field, and k m be the affine m— dimensional space. We consider a design with m 
factors, where the levels of each factor are coded with integer, rational, real or complex numbers. In practical situations 
k is the set of the rational numbers Q, For the indicator function representation, we need an extension of Q to include 
the imaginary unit and some irrational real numbers. This is a computable set. Then, a design T is a finite set of n 
distinct points in k m . Let R = k[xi, . . . , x m ] be the polynomial ring in m indeterminates with coefficients in k. The 
indeterminates in R correspond to the design factors. 

Three notions from algebraic geometry/commutative algebra are corner stones. 



• Ideal of a design. The design ideal of T is 

I{T) = {/ G R\f(C) = for all f G T}. 

I(T) is an ideal, i.e. f + g E I(T) for all /, g G I(T) and fg G I(T) for every / G I(T) and g G R. The 
Hilbert Basis theorem states that every polynomial ideal is finitely generated. Thus, there exist fi,...,f r E 
I(T) such that 

r 

f E I(T) if, and ony if, / = Sifi for some Si G R. 

i=l 

The set of generators fi,...,f r is not unique. The ideal generated by /i, . . . , f r is indicated by (/i, . . . , /,.}. 
Conversely, given an ideal /, the set of zeros of / is, by definition, an algebraic variety and corresponds to the 
zero-set of any of its generator sets. 

• Interpolation. For any fc— valued function, F, defined on a design T, there exist (interpolating) polynomials 
/ G R such that /(C) = F{() for all C G T. 

• Quotient space. A standard algebraic construction is the quotient ring R/J for any ideal J C R. The relation 
~ defined as {/ ~ g if, and only if, / — g E J} is an equivalence relation. The elements of R/J are the 
equivalence classes of ~ .R/J inherits a ring structure from R by defining sum and product of classes as 
[/] + [<?] = [/ + <?],[/][<?] = [/<?]• 

If /, 5 € -R interpolate the same function F defined on the design T, that is to say, / and g are aliased on T, 
then / — g is zero if evaluated on each £ G T, and so / — g G I{T). Hence, there exists a unique class in 
R/I(T) that contains all the polynomials interpolating the same function F. 

In algebraic geometry a design T is seen as a zero-dimensional variety. The focus both in algebraic statistics and 
in this paper switches from the design T to its ideal I(T). As we shall see below, the Grobner representation and the 
indicator function representation of T are nothing else than two sets of generators of I(T). Replicated points can be 
considered but some technical issues, which are briefly illustrated in Example[2] occur which are outside the scope of 
this paper. 
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Example 2. In Q[xi, X2] consider the two ideals 1\ and I2 defined as 1\ = (x\, x\\) and I2 = (xi + X2, x^)- The zero 
sets of I\ and I2 are equal and consist of the point (0, 0) with multiplicity two, as can be easily checked by solving 
the two systems of equations X\ = x\ = and x\ + x^ = x\ = 0. But the two ideals are not equal because the 
polynomial x\ is in 1\ but not in I2 and conversely x\ + a; 2 G I2 but not in I\. 

In general, questions like equality of ideals (ii = I2), membership of a polynomial to an ideal (x\ £ 7i and 
x\ I2), intersection and sum of ideals can be handled by using computer algebra softwares. 

□ 



2.1 Indicator function 

To define the indicator function of T we must consider J 7 as a subset of a larger design T> C k ,n . Usually, but not 
necessarily, T> has the structure of a full factorial design. The indicator function F of T C T> is the response function 



The p olynomial indicator function for two leve l fractional factorial d esigns were introduced in iFontana et al. 



( 1997) and Fo ntana et al. (2000) and independently in Tang and DendJl999h with a slightly different presentation. An 



extension to two-level designs with replicate s is in HJ2003) and to multilevel factors, using orthogonal polynomials 



with integer coding, in lCheng and Yd (120041) . 

The case of factorial designs is treated in Pistone and Rogaiitinl ( 2007b). where the rij levels of each factor are 



coded by the nj-th roots of the unity, j = 1, . . . , m. With this coding an orthonormal base of the response space on 
the design is formed by the set of all the monomial terms: 

{x a , aeL} and L = {a — (a±, . . . , a m ) , ctj = 0, . . . , rij — 1 and j = 1, . . . ,771} . 

The indicator function F is a real valued polynomial with complex coefficients: J2 a eL -^ Q (C)' ( £ T>. In this 
case, the coefficients are related to many interesting properties of the fraction in a simple way: orthogonality among 
the factors and interactions, projectivity, aberration and regularity. For instance, the fraction is regular if and only if all 
the coefficients are equal to the ratio between the number of fraction points and the number of the full design points; 
the level of a simple factor of an interaction occurs equally often in the fraction if and only if the coefficient of the 
corresponding term is zero; two simple factors or interactions are orthogonal if and only if the coefficient of the term 
with exponent the sum of the two exponents is zero; a fraction is an orthogonal array of strength s if and only if the 
coefficients of the terms of order lower than s are zero. 

Example 3. The fraction of a 3 4 full factorial design Tn = {(1, 1, 1, 1), (1, u)\, uj\, (1, 0J2, 0J2, ^2), 

(wi, lo 2 ), (wi,wi,w 2 , 1), (wi,o>2! l,wi), (oj 2 , l,W2,wi), (k>2,wi, l*^), (u>2,U2,ui, 1)}, where 1, u>i, u> 2 are the 

cubic roots of the unity, is a regular fraction; in fact, its indicator function is 

ip 1 /i i ,222, 2,22 , 2 i2 2, 2,22\ 

r = — (1 + X2X3X4 + X 2 X 3 X A + XlX2X 3 + X 1 X 2 Xz + X\X 2 XA + X 1 X2X 4 + X\X%X^ + X^X^Xij . 

The fraction points are 1/9 of the 3 4 design; in fact the constant term is 1/9. Moreover, each factor is orthogonal to 
the constant term as shown by the fact that the coefficients of the terms of order 1 are 0. Any two factors are mutually 
orthogonal; in fact the coefficients of the terms of order 2 are 0. The interaction terms appearing in the indicator 
function are the "defining words" of the regular fraction. 

Example 4. The fraction of a 2 5 full factorial design T = {(1, 1, 1, 1, 1), (1, 1, 1, -1, 1), (1, 1, -1, -1, 1), 

(1,-1, -1,1,1), (-1,1, -1,1,1), (-1,-1, 1,1,1), (-1,-1, 1,-1,1), (-1,-1, -1,-1,1), (1,1, -1,-1,-1), 
(1,-1, 1,1,-1), (1,-1, 1,-1,-1), (1,-1, -1,1,-1), (-1,1, 1,1,-1), (-1,1, 1,-1,-1), (-1,1, -1,1,-1), 
(—1,-1,-1,— 1,-1)} is an orthogonal array of strength 2; in fact, its indicator function 

11 1 1 1 

F = - - -X1X2X4 + -xix 2 x 5 + -XiX 2 x 3 Xi + -xix 2 x 3 x 5 
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contains only terms of order greater than 2, together with the constant term. □ 

When the coordinates of the points in T and T> are known, the indicator function F can be computed using some 
form of interpolation formula for Equation ( 12.1b . The CbCbA function IdealAndSeparatorsOf Points is used 
in Example [6] If the complex coding is used, the coefficients of the indicator function of a fraction of a full factorial 
design can be easily computed from the sum of the values of each monomial response on all the fraction points: 

Example 5. We consider the fraction T ' = {(— 1,— 1,1),(— 1,1,— l)}of the 2 3 full factorial design. All the monomial 
responses on T are 



1 


X\ 


X2 


£3 


XiX 2 


XiX 3 


X2X3 


X1X2X2 


1 


-1 


-1 


1 


1 


-1 


-1 


1 


1 


-1 


1 


-1 


-1 


1 


-1 


1 



and the coefficients b a are: 

2 2 

0(o,i,o) = 6(0,0,1) = 6(1,1,0) = 6(i,o,i) = 6(0,0,0) = 6(1,1,1) = 4 °(i,o,o) = 6(0,1,1) = —7 • 

Hence, the indicator function is F = \ (1 — x\ — £2^3 + X1X2X3). 

Example 6 (Continuation of Example[T|i. The indicator function of Ta as subset of a 3 2 factorial design is 

F = —2 X1X2 + x\ + x\. 

A C0C0A algorithm for the computation is provided in Item 1 of the Appendix. If the levels are coded with the 3-rd 
roots of the unity, the indicator function is 

Z7> ft 11 2 2 2 2 2 2\ 

V = — (1 + X\ + X2 — X l — X 2 — X\X2 — X\X 2 — X X X2 ~ X 1 X 2 )- 

9 

We can check that there are no mutually orthogonal terms. The indicator function of Tp will be computed in Example 
U3 



2.2 Grobner bases 

When working with polynomial ideals, it is useful to choose a standard form for writing the polynomials. This can 
be done by choosing a term ordering. That is an order relation on the monomials of R, compatible with the product of 
monomials. In more details, a monomial is written as x a = x" 1 . . . with a = (ai, . . . , a m ) and aj £ Z>o for 
alH = 1, . . . , m. The ordering relation >- is a term ordering if 1) a;" y 1 for all exponents a and 2) if x a >- x^ 3 then 
x a+ ~< >- x /3+7 for all a, f3, 7 G Z> . The leading term of / G R with respect to >- is the largest term of / with respect 
to )~ and we write LT^(/), or LT(/) if no confusion arises. 

Example 7. The lexicographic term ordering is defined as x" 1 ■ ■ ■ x"™ >~ x^ 1 • ■ ■ x^ if a\ — 0i, . . . , aj_i = 
jSj-l, ot-i > Pi, for some i G {1,2,..., m}. In the tdeg ordering x" 1 • • • x^ % >~ x^ 1 ■ ■ ■ x^ if, and only if, ^2 a i > 
J20i° T J2 a i = 2 A an d tne right-most nonzero entry of (a\ —01,..., a m — (3 m ) is negative. □ 

Given a term ordering >- and an ideal I C R, let LT^ (I) = (LTy (/) : / G i) be the set of leading terms of all 
polynomials in /. 

Definition 1. Let / be an ideal, >- a term ordering and G = {gi, ■ ■ ■ ,gt} Q I- 

1. G is a Grobner basis (sometimes called a standard basis) of / if LT^ (J) is generated by (LTV (3) : 5 £ G) ■ 

2. G is a reduced Grobner basis if for all g G G the coefficient of the leading term of g is 1 and no term of g lies 
in<UIV(G\{ 5 })}. 
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Note that a Grobner basis of an ideal / is a particular generator set of I. For every i deal / and term ordering >~ 



there exist Grobner bases of / and a unique reduced Grobner basis, see (ICox et all 119971 Ch.2). Grobner bases of / 



can be computed from any generator set of / with the Buchberger algorithm which is implemented in most softwares 
for algebraic computation. For every ideal there is a finite number of reduced Grobner bases as the termorderin varies, 



see 



Mora and Robbiano ( 1988). 



Example 8 (Continuation of ExampleHJ. For any term ordering for which x\ >~ x 2 the reduced Grobner basis repre- 



sentation of I{Ta) is given by the three polynomials g\ = x{ + x\ — 1, 52 



— „-3 



X2 and 33 = x\X2- The polynomial 



gi indicates that the points of Ta are on the unit circle, g 2 that the factor corresponding to X2 has three levels 0, ±1 
and 93 that at least one coordinate of each point in J 7 a is zero. 

The reduced Grobner basis of I{Tp) for any term ordering such that x\ >- X2 >~ X3 has five elements 

hi = X\ + x% + X3 — 1 

h 2 = x 3 (x 3 - l/2)(x 2 + Z3/2 - 1/2) 

h 3 = x 3 (xj + x\/2 - x 2 /2 - 3/4.T3 + 1/4) 

hi = O2 - x 3 )(x% + x 2 x 3 + xl- 3/2x 3 - 3/2x 3 + 1/2) 

h 5 = x 3 {x 3 - l/3)(x 3 ~ l/2)(a!8 - 1) 

In hi we can recognize the sum to one condition for a mixture design and in /15 the levels of the X3 factor. □ 

If we fix a Grobner basis of an ideal / C R, then for every equivalence class [/] G R/I there exists a unique 
/' £ [/] written as combination of monomials n ot divisible by any monomial in LT(J). The polynomial /' is called 
the normal form of / and we write NF(f), see ICox et al. (1997). Hence, Grobner bases give a tool to effectively 
perform sum and products in the quotient ring R/I. 

Given a design J 7 , the quotient ring R/I (J 7 ) is a vector space of dimension equal to the cardinality of T. A 
monomial basis of R/I (J 7 ) can be used as support for a statistical (saturated) regression model as the corresponding 
information matrix is invertible. A vector space basis of R/I^J 7 ) can be determined by using Grobner bases, and 
the procedure, which we call Gbasis/LT, is the following. The monomials which are not in LT(/(^ r )) are linearly 
independent over the design. Call this set Estjr. They are those monomials which are not divided by any of LT(g) for 
all g in a Grobner ba sis of I(T). This is equivalent to the fact that the columns of the matrix X — [C"]^g^- a eL nie 
linearly independent (Pisto ne et al. d200lb ). where L is the set of the exponents of the elements of Estjr. 

Example 9 (Continuation of Example [TJ. In the setting and notation of Example [8] the leading terms of the Grobner 
basis elements of I {J 7 a) are LT(gi) = x\, LT(g2) = x\ and LT(g3) = XiX%. The four monomials 1, xi,x 2 , x\ are 
not divisible by these leading terms, equivalently the first four columns of X below give an invertible matrix. 





' 1 


Xi 


X2 


x\ 


x\ 


C 




1 


1 








1 


(1-0) 


X = 


1 





1 


1 





(0,1) 




1 


-1 








1 


(-1,0) 




1 





-1 


1 





(0,-1) 



The linear response model build on any combination of the first four columns of X is identified. The last column lists 
the design points. From g\ = x\ + x\ — 1 we deduce that x\ = 1 — x\, that is the fifth column of X is the difference 
between the first column and the fourth column. 

For Tp, we have Estjc p = {1, X3, x|, x\, X2, X2X3, x\\- Note that there is no term involving xi as gi = xi + 
x 2 + x 3 — 1 confounds Xi with X2 and X3 (see Section |2~3l >. □ 

The design ideal embeds all possible aliasing relations imposed on polynomial responses by a design. A Grobner 
basis is a special finite set o f aliasing relations am ong polynomial responses defined on the fraction and are a basis of 
all other alias relations, see Holliday et al. (1999). Other special finite sets can be found using the indicator function. 
Theorem 4 and Example 10 in lPistone et al.1 (120071) present an algorithm based on the computation of the normal form, 
with respect to the full design, of all the monomial responses multiplied by the indicator function of the fraction. 
Knowledge of the problem to be modelled indicates whether the sets of aliasing relations from the indicator function 
or from the Grobner bases are more informative. 
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2.3 Designs for experiments with mixtures 



We need to refer here a short summary of iMaruri-Aguilar et al. ( 2007 ). Each point ( = (d, . . . , £ m ) 6 k m 
of a design T for a mixture experiment satisfies the conditions that d > for all i = 1, . . . , m and J^i d = 1- 
The polynomial J^i Xi — 1 € and thus not all the linear terms can be in the support of a regression model 

simultaneously. In particular the Gbasis/LT procedure applied to a design for a mixture experiment returns slack 
models which include the identity/intercept and miss completely one factor. For a mixture design T C k rn there 
exists, well defined, a unique cone passing through T and the origin: 

Cjr = {a( : C e T and a E M} C k m . 



This can be thought of as a projective variety. The Gbasis/LT procedure is specialized to mixture designs exploiting 
the fact that projective varieties and homogeneous polynomials are naturally associated. Consider a term order, the 
cone Cjr and all homogeneous polynomials of degree s in R. Compute a Grobner basis of Cjf, its leading terms and 
the set of mon omials of degree s no t divisible by the leading terms. Then, the information matrix for this set and T is 
invertible (see Mar uri-Aguilar et al. ( 2007b V 

Example 10. Consider the design T = {(0, 0, 1), (0, 1, 0), (1, 0, 0), (1/3, 1/3, 1/3)} C T P , and any term ordering 
such that x\ >~ X2 >■ x%. Then I{Cf) = ( X1X3 — x2X3, x\X2 — X2X3,x\x3 — X2X 2 ). The leading terms are 
underlined. In Table 2.1 va rious homogeneous models id entified by T are given. Notice that they are Rronecker 
models generalizing those in lDraper and Pukelshe im ( 1998). 



Table 2.1: Homogeneous models identified by T = {(0, 0, 1), (0, 1, 0), (1, 0, 0), (1/3, 1/3, 1/3)} 



list of monomials of degree s 



degree s standard monomials 




1 

2 
3 

s > 3 



1 

XI, X 2 , X3 

X 2 , X1X2, x\, X1X3, £2X3, x\ 

32 232 2 223 

•^1*^2; X\X*2, 3*2 > ^1*^3 ; ^1^2^3 1 ^2*^3' ^1^3? *^2^'3i ^3 
„s-l_ ™s-2„ 



*^2? ^1 



; x 3 



1 

Xl, X 2l X3 

2 2 2 
X-y, X 2 , X2X3, X3 

3 3 2 3 
X 1; X 2 , X2X3, X^ 

xf, X2, X2X3 , 3 



□ 



X\X3 



X2X\,X\X3 



XxX%,x\x2 



X1X2} is the reduced Grobner 



Example 1 1 (Continuation of Example[T|i. The set { 
basis of I(Cf p ) with respect to the tdeg ordering with x\ >- x 2 >- X3. For s = 3, Estjc-^, = {x 3 , xy 2 , y 3 , 
xyz, xz 2 , yz 2 , z 3 } gives the support for a homogeneous saturated regression model identified by C^ p . 

We need to observe now that ratios of homogeneous polynomials of the same degree are functions well defined 
on the affine cone of a mixture design. 

Example 12. Consider the three points Pi — (1, 0, 0), P2 — (0, 1, 0), P3 — (0, 0, 1) and the lines Li through Pi and 
the origin (0,0,0). The cone over the points Pi , P2 , P3 is equal to Li U L2 U L3 . Let F be the function which assumes 
the value i on Li, i = 1,2,3. First, we show that F cannot be represented as a polynomial. In fact, if / is a polynomial 
such that f(x, 0, 0) = 1 for each x ^ 0, then / = 1 + f\{y, z). But, /(0, y, 0) = 1 + fi(y, 0) = 2 for every y ^ 0, and 
so / = 2 + 72(2). Hence, f(x, 0, 0) = 2, and so F cannot be represented by a polynomial. Next, note that x ^+y+ 3 z z 
represents F on the considered cone. □ 

The above leads to the following definition, which specializes the id eal of indicator function to mixture designs. 
The larger design V could be any mixture design, i.g. a simple lattice, see IScheffa (1 1 9581) or a simple centroid design, 
see Scheffl (ll963h . Example [12] shows that we need to consider ratios of polynomials of the same degree to define a 
function on Ct> and not simply a polynomial, see Definition 2.B2 and 2.B 1 below. Furthermore, the notion of separator 
as introduced is for consistency with algebraic standard, and takes zero value in V \ T. 
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Definition 2. Let T C P c A:™ be designs for a mixture experiment. 
Al) A separator of £ S J 7 is any homogeneous polynomial 5^ such that /(Cr^i) and e I(Cf\iq). 

S 

A2) The separator function of £ S J 7 is S^i. = m - — -g- where is the degree of S{ . 

Bl) A separator of J 7 C P is any homogeneous polynomial 5^- such that Sf $ I{Cp) and Sj? E I(Ct>\f)- 

B2) The separator function of T C P is 5Fjr = m -^r = where sjr is the degree of S?. 

\L^i=\ X i) Q e jr 

Example 13. The cone generated by Tpp — {(0, 0, 1), (0, 1, 0), (1, 0, 0), (1/3, 1/3, 1/3)} C Tp is the same as the 
cone generated by T = {(0, 0, 1), (0, 1, 0), (1, 0, 0), (1, 1, 1)} C P where P \ T = {(1, 1, 0), (1, 0, 1), (0, 1, 1)}. We 
have 



x\ - 2x\x\ + 7Z2xiX 2 x\ - 2x\x\ + x% - 2x 2 x\ + x% 



(xi +x 2 + x 3 ) e 

indeed S^(C) = 1 if C e T or if ( E T PF and S>(C) = if ( E V \ T or C G \ ^>p- In ExamplelH we 
shall compute a lower degree separator for the same fraction. This shows that there exist different ways of writing in 
polynomial form the separators for the same fraction. □ 

We are now ready to name the two polynomial representations of a design. 

Definition 3. Let >- be a term ordering on k, T C P c k m two designs. 

1 . The Grobner representation of T with respect to y is the reduced Grobner basis of I (J 7 ) with respect to y. 

2. Let {d\, . . . , dp} C R be the reduced ^-Grobner basis of 1(D) and F the indicator function of T in P. The 
indicator representation of C P with respect to >- is {d\, . . . , d p , F — 1}. 

Suppose now that J 7 and P are designs for mixture experiments. 

1. The homogeneous Grobner representation of T with respect to >- is the reduced Grobner basis of I{Cp) with 
respect to y . 

2. Let {di, . . . , dp} C i? be the reduced ^-Grobner basis of I(Cx>) and Sx>\p the separator of P \ J 7 in P. The 
homogeneous indicator representation of J C P with respect to y is {d\ , . . . , d p , Spv^}. 

As a mixture design is in particular a design, it admits both the homogenous representation and the non-homogeneous 
representation. Of course, when using the non-homogeneous one we loose the advantages introduced with the design 
cone. 

While in the non mixture case {d\, . . . , d p , F — 1} is a generating set of in the mixture case the ideal 

I(Cyr) is the saturation of the ideal (d±, . . . , d p , S-p\f). The saturation I sat of a homogeneous ideal I C R contains 
all the homogeneous polynomials / such that fx r [ li E I for some m, E Z>o and every i = 1, . . . , to. In fact, the 
given generators are homogeneous and so they span only homogeneous polynomials of degree not smaller than the 
degrees of the generators, while in the saturation we obtain also polynomials of degree smaller than the degree of 
the generators. For example, the ideals (x) and (x 2 ,xy) in R = k[x, y] have the cone o ver P = (0, 1) as zero set- 
Furthe rmore, I{Cp) = (x) = (x 2 ,xy) sat . For more on saturation see ICox et al.l d2005l) and iKreuzer and Robbiano 
(l2000h . 

We could substitute the requirement of reduced Grobner bases with that of generating sets. Uniqueness of repre- 
sentations will be lost, while there will be no longer dependence on a term-ordering. 
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3 Changing representation 

Let F be the indicator function of J- in V C k m , I(T>) = (d\ , . . . , d p ) and I(J-) = (d\, . . . , d p , gi , . . . , g q ). Note 
that usually the generator set {di, . . . , d p } is known and has an easy structure, often being T> a full factorial design 
and hence dj a polynomial in xj for j = 1, . . . , m, or a simplex lattice design in the mixture case. The difficulty and 
interest are related to T. Then, 

1. I(J-) = (d\, dp, F— 1). This means that once F is known, a Grobner basis of I (J 7 ) is obtained by applying 
the Buchberger algorithm to {d\, . . . , d p , F — 1}. 

2. Vice versa, the lexicographic Grobner basis (see Example|7]i for h >- f >- x of the ideal 

(di, . . . , dp, (1 - /) - ^2 hjgj, fg x , . . . , fg q ) 

j 

contains a unique polynomial of the form / — p(x) where p is a polynomial in the x indeterminates only. Then 
the eval uation function F : V — > {0, 1} defined as F(d) = p(d) for d 6 V, is the indicator function of T in 



V. See dPistone et all 12007. Ch. 6, Th. 2 and 3) 



Items 1. and 2. above provide algorithms to switch from indicator function representation to Grobner basis 
representation and vice versa. While the passage from the indicator function to the Grobner representation is relatively 
easy as it consists of the union of polynomials, equivalently a sum of ideals, the computation of the lexicographic 
Grobner basis required for the passage to the indicator function representation can be computationally expensive and 
often the computation does not terminate. In Section|4]we describe a faster algorithm for this. 

Items 1 . and 2. are easily adapted to the mixture/homogeneous case by considering the cone ideal and the (rational) 
separator function, i.e. ideals generated by homogenous polynomials, for example, F — 1 has to be substituted with 
Sjr — (^2 Xi) s where s — deg(Sf)- Moreover, each time we define an ideal, we must saturate it, to compute generators 
of small degree. See Item 2 of the Appendix for a GdGoA algorithm. 



4 An efficient algorithm 

In this section, we present a different and more efficient algorithm to switch from the Grobner representation to 
the indicator one for a fraction T of a design V. The algorithm is based on the following remark: a polynomial / 
interpolating the indicator function F of T belongs to I(T> \ T) because of the definition of design ideal. Moreover, 
1 — / belongs to I(!F) for the same argument. If G = {gi, . . . , g q } is a Grobner basis of I(J-) then the second remark 
says that 1 — / = Y^i=i ^iflii f° r some h\, ■ ■ ■ ,h q S R. Hence, / = 1 — Yji=i ^i9i nas normal form in Rj 1(V\T). 

The problem now is to efficiently choose hi, . . . , h q . If T> has cardinality N and T has cardinality n, then we 
want h%, . . . ,h g to depend by N — n parameters because / = in R/I(V \ J 7 ) and the dimension of R/I(T> \ J 7 ) 
as vector space is N — n, written as dim R/ I(T> \ T) — N — n. Moreover, we would like to compute hi, . .. ,h q 
with linear algebra techniques, because they usually have smaller computational complexity than Grobner bases based 
algorithms. 

Lemma[T]and Theorem[T]describe how to chose efficiently hi, . . . , h q . 

Lemma 1. Let Estp (resp. Estjr) be a monomial basis of R/I(T>) (resp. R/I^J 7 )) computed by using the previously 
described procedure Gbasis/LT. Then Estjr c Estp. 

Proof. T is a fraction of V, i.e. J 7 C T> and I{V) C /(^ r ). We prove the equivalent statement: if x a £ Estp then 
x a £ Estjfr. Let x a ^ Estu then there exists a polynomial in 1(D) whose leading term is x a . The inclusion between 
the ideals shows that x a ^ Estjr, and the statement holds. □ 

Let x ai , . . . , x aN ~™ be the monomials in Est£> \Est?r. For each j = 1, . . . ,N — n there exists g^tj) in the Grobner 
basis G of such that x aj = nij LT^yj) for some monomial rrij, because of the construction of the monomial 
basis of a quotient ring. 
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Theorem 1. With the notation as above, the classes of mig^i) , . . . , rnM- n gi(N-n) are a basis of R/I(V \ T). 

PROOF. It is sufficient to prove that they are linearly independent. Let a%, ... , etjv-n G k be elements of the ground 
field such that 

a l m l9i{\) + ' ■ ' + ajV-n«1jV-nffi(Ar-n) 

is the zero class in R/I(T> \ J 7 ), that is to say, it belongs to I(V \ T) because of the definition of the quotient ring. 
Hence, we have that aiiiiij^) + • • • + aN^n'mN-ngi(N-n) G T(T>) because it vanishes also on the points in T being 
a combination of elements in a Grobner basis of I (J 7 ). By construction, the leading terms of rajg^ are all different 
and so the leading term of aimig^ + ■ ■ ■ + cin -nrriN ~ngi(N -n) > sav m i LT^m , is in Estp. This is not possible, 
and so a\ = 0. By iterating the argument a finite number of times, we obtain that a% = ■ ■ ■ = ajv_„ = and the claim 
follows. □ 

We know that / = 1 — Yli=i hidi an d so we nave tnat tne indicator polynomial / can have the form / = 
1 — aimig^!) + • • • + aM-nTnN-ngi(N-n) G I(J) \ J-). Hence, if we compute the normal form NF(f) of / in the 
ring R/I(T> \ J 7 ), it must be 0. In general, NF(f) is a polynomial with monomials in Est£>\j*r and linear combinations 
of a%, . . . , ajv-n a s coefficients. Therefore, we obtain a linear system in the unknowns a%, . . . , ajv_ n which has a 
unique solution by TheoremQ] The resulting algorithm is implemented in Maple in Item 3(a) of the Appendix. 

A few modifications are needed to adapt the algorithm to mixture designs. First, we consider homogeneous 
polynomials of a fixed degree. To speed up computations, we work with polynomials of degree s where 



The integer s can be easily computed by using the Hilbert function of R/I(T>) that calculates the dimension as vector 
space of the degree t polynomials in R/I(T>) for every t G Z>o. Second, we compute monomial bases Est© s and 
Est?r jS of the degree s pieces of the quotient rings (R/I(D)) S and (i?//(JT)) Sj respectively. Third, the indicator 
function is now a ratio F = f /{x± + ■ ■ ■ + x m ) s where deg(/) = s with the constraints / £ I(T> \ T) and 
(x\ + • ■ • + x m ) s — f G Hence, the changes are straightforward and the result follows also in this case. See 

Item 3(b) of the Appendix for a Maple algorithm . 

Example 14 (Continuation of Example [TTJ. The set {x\ } C Estjp-p is identified by the fraction T = 

{(1, 0, 0), (0, 1, 0), (0, 0, 1), (1, 1, 1)} C Tp. The ideal of the fraction is (£3(2:1 — X2), {x\ - xs)x2, X2Xz(x2 — X3)}, 
while the ideal of Tp \ T is {x-i{x\ + X2 — X3),X2(xi — x 2 + x s ) 7 xl — x\ + 2x 2 x- 3 — x|, X2X3(x 2 — x 3 )). Then, by 
applying the previous algorithm, we obtain the following indicator function 



The following Maple script performs the computation using the algorithm in Item 3(b) of the Appendix. 

var : = [x,y,z] -- we change x_l f x_2 , x_3 to x, y, z, respectively 

EstX:= {x"3, x y " 2 , y " 3 , x*y * z , x* z " 2 , y * z " 2 , z ~ 3 } — monomial basis of (R/I(F_P))_3 
EstY:- { x~ 3 , y " 3 , y *z * 2 , z * 3 } monomial basis of (R/I(F))_3 

GY:= { z* (x-y) , (x-z) *y, y*z* (y-z) } 

GXMinusY : = z*(x+y-z), y+(x-y+z), x~ 2-y " 2+2 +y +z-z " 2 , y*z*(y-z) 
G_to_F_homo (GY, GXMinusY, EstX, EstY, var, t ) 

x~3 + 3x~2y + 3zx"2 - 5xy~2 + 30xyz - 5xz"2+ y"3 + llzy~2 - 13yz~2 + z"3 




SFr 



x'f + 3x^X2 + 3x^X3 — 5x\X2 + 3OX1X2X3 — 5xix§ + x\ + Wx\ 



|x 3 - \3x 2 x\ +xl 



(Xl + X 2 + X 3 ) 3 



□ 
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5 Example 



The simplex centroid design is defined in IScheffa d 1963b and used for experiments with mixtur es. In k factors 
it has 2 fc — 1 points. Fractions of the simple centroid design with many less points are defined in iMcConkev et al 



(2000) and used to screen for significant factors. Their definition depends on an integer parameter p and the double 
interactions are completely aliased over any such fraction in sets of size p. A typical example is Tmc below 

T MC = {(1, 0, 0, 0, 0, 0, 0, 0, 0), (0, 1, 0, 0, 0, 0, 0, 0, 0), (0, 0, 1, 0, 0, 0, 0, 0, 0), 
(0, 0, 0, 1, 0, 0, 0, 0, 0), (0, 0, 0, 0, 1, 0, 0, 0, 0), (0, 0, 0, 0, 0, 1, 0, 0, 0), 
(0, 0, 0, 0, 0, 0, 1, 0, 0), (0, 0, 0, 0, 0, 0, 0, 1, 0), (0, 0, 0, 0, 0, 0, 0, 0, 1), 
(1/3, 1/3, 1/3, 0, 0, 0, 0, 0, 0), (1/3, 0, 0, 1/3, 0, 0, 0, 1/3, 0), (0, 1/3, 0, 0, 1/3, 0, 0, 0, 1/3), 
(0, 0, 1/3, 0, 0, 1/3, 1/3, 0, 0), (0, 0, 0, 1/3, 1/3, 1/3, 0, 0, 0), (0, 1/3, 0, 1/3, 0, 0, 1/3, 0, 0), 
(0, 0,1/3, 0,1/3, 0, 0, 1/3, 0), (1/3, 0, 0, 0, 0, 1/3, 0, 0, 1/3), (0, 0, 0, 0, 0, 0, 1/3, 1/3, 1/3), 
(1/3, 0, 0, 0, 1/3, 0, 1/3, 0, 0), (0, 1/3, 0, 0, 0, 1/3, 0, 1/3, 0), (0, 0, 1/3, 1/3, 0, 0, 0, 0, 1/3)} . 

The design Tmc can b e seen as a subset of the simplex centroid design in 9 factors T>\ and also as a fraction of 
the simplex centroid which includes the corner points of the simplex and the points with coordinates equal to zero or 
1/3, call it V 2 . 

IdealOf Points(^Mc) with respect to the default term ordering in C bCbA is generated by 43 po lynomials, 
while IdealOfPro jectivePoints(^/c) is generated by 42 ones. See iMaruri-Aguilar et alj (12007) for a dis- 
cussion of these results. 

The indicator functions of Tmc in 2?i an d of Tmc in f 2, and the separator of Tmc in £>2 have been computed 
by using the Maple algorithm in the Appendix in less than 10 sec. the first one, and in less than 1 sec. the last two. 
The indicator functions of Tmc in T>\ is a combination of 444 terms, in T>2 is a combination of 70 terms, and the 
separator of Tmc in 2?2 is a combination of 165 terms. 



6 Discussion 

This note regards two polynomial representations of an experimental design T one of which uses the indicator 
function of T in T> and the other one uses Grobner bases which does not require to think of T as a fraction of the larger 
design V. The Grobner representation depends on a technical object: a term ordering, while the indicator function is 
most informative with a complex codin g of the factor levels. In a pplied work term orderings h ave been used to the 
advantage of the statistical analysis, see IHollidav et al. dl999h . In Pistone and Rogantin (2007a) it is shown that the 



real part of the complex response retains most of the properties of the full complex response while having a clearer 
physical interpretation. Moreover, notice that most of the properties discussed in Section 2.1 depend intrinsically on 
the level coding. A trivial example is that the 2 2 full factorial design with levels ±1 is orthogonal for {1, %\,X2, X1X2} 
while with levels {0, 1} it is not. 

Both representations can be used to identify alias relations imposed by T on Est©, which is a finite set, or on some 
other sets of monomials, possible all the infinite set of monomials. Furthermore, both provide a vector-space basis of 
the response space. This is hierarchical for the Grobner basis representation. 

The choice as to which representation to use should be made in the light of the interests of the practitioner. If the 
responses have been collected and standard or slower techniques have not returned a satisfactory statistical analysis 
or are not implementable (maybe because there are missing values with respect to the planned experiment), then it 
seems convenient to apply the GBasis/LT procedure. This returns an identifiable hierarchical regression model and 
the alias relations in the Grobner basis can be used to change model terms with more significant or interpretable 
model interactions. Instead, prior to data collection, the indicator function seems a useful tool to select a design with 
relevant properties by working on the coefficients of the indicator functions. Two issues have to be considered: 1. the 
need of a complex coding for some properties and 2. the need to solve a system of polynomial equations to obtain 
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the point coordinates. Point 1 has been discussed above. In most practical cases Point 2 can be easily addressed by 
computing a Grobner basis with respect to a lexicographic ordering. Generally the joint use of the two representation 
seems advisable, also in the light of the switching algorithms in Sections 3 and 4. Indeed, if one representation is 
known then the other one can be computed using techniques of linear algebra. This is possible because designs are 
zero dimensional varieties. The complexity of the algorithms in the Appendix is essentially the complexity of the 
computation of the normal form of a polynomial w.r.t. a Grobner basis. In fact the last step of the algorithm consists in 
solving a linear system which has a smaller computational complexity. For the large designs of Section|5]the algorithm 
gave the solutions in just a few seconds, as previously mentioned. 

Finally, we wanted to have in the public domain a complete set of computer functions to perform the computations 
in the diagram of the Introduction. 

Appendix 

1 . The GaGoA code for the indicator function of Ta m Example [6] 

Use S: :=Q[x[l. .2] ] ; 
Define InFu (Points , D ) ; ND:=Len(D); PA : =NewList (ND , 0) ; P :=NewList (ND) ; 
For H:=l To Len (Points) Do 

For K:=l To Len (PA) Do If Points [H] =D [K] Then PA[K]:=1 End; End; 
End; 

IdD : =IdealAndSeparatorsOf Points (D) ; 

For K:=l To Len (PA) Do P [K] : =PA [K] *IdD . Separators [K] End; 
F:=Sum(P); Return F; 
End; 

D:=Tuples ( [-1, 0, 1] , Numlndets ( ) ) ; PointsF:= [ [ 1 , ] , [ -1 , ] , [ , 1 ] , [ , -1 ] ] ; 
InFu (Points, D) ; 



2. The GaGoA code for of ExampleQj] with the algorithm described in Item 2 of Section[3] 

Use T::=Q[f h x[1..3]], Lex; 
Set Indentation; 

D:=[x[2] ~2x[3] - x[2]x[3]~2, x[l]"2x[3] - x[l]x[3]"2, x[l]"2x[2] - x[l]x[2]"2]; 
— simplex lattice 

G: = [x[l]x[3] -x[2]x[3], -x[l]x[2] +x[2]x[3]]; -- ( , , 1 ) , ( , 1 , ) , ( 1 , , ) , ( 1 , 1 , 1 ) 
P:=(x[l]+x[2]+x[3] ) "3; S: = []; 
For I:=l To Len (G) Do 

L:=Concat Lists ( [D, [P-f-hG [I] , fG [I] ] ] ) ; 

Id: =Saturat ion (Ideal (L) , Ideal (x [1] , x [2] , x [3] ) ) ; 

GB:=ReducedGBasis (Id) ; S:=Concat( [f-GB[l]], S) ; 

EndFor; 

SF:=NF (Product (S) , Ideal (D) ) ; SF; 

3. The Maple code of the procedure described in Section [4] to compute the indicator functions ofYcX where 
Y and X are sets of points. The affine and the projective cases are considered. Notice that all computations are 
with respect to the tdeg term ordering of var. This can be changed by the user. 

(a) Affine case. In input the procedure requires: 

GY = Grobner basis of /(Y), 

GXMinusY = Grobner basis of I(X \ Y), 

EstX = standard basis of R/I(X), 

EstY = standard basis of R/I(Y), 

var = list of indeterminates 

The Output is the polynomial representation of the indicator function in R/Ix- 
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G_to_F := proc(GY, GXMinusY, EstX, EstY, var) 

local E, InY, ly, L, flag, tt, F, i, m, 1, v, CC, VV; 

with (Groebner) ; 

v : = op (var) ; 

E := 'minus ' (EstX, EstY) ; 

InY := [seq (leadterm (GY [i] ,tdeg(v) ) ,i = 1 . . nops (GY) ) ] ; 
ly : = nops (InY) ; 
L := [] ; 

for m in E while true do 
flag :=0; 

while flag = do for i to ly do if gcd ( InY [ i ] , m) = InY[i] 

then flag := 1; tt := i end if end do end do; 
L := [op (L) , GY [tt] *m/leadterm (GY [tt] , tdeg (v) ) ] 
end do; 

F := 0; i := 0; 

for 1 in L while true do i := i+1; F := F+a[i]*l end do; 
CC := coeff s (normalf (1-F, GXMinusY, tdeg (v) ), [v] ) ; 
VV := solve (\ {CC\} ) ; 
expand (subs (VV, F+l) ) 
end proc; 



(b) Projective case. In input the procedure requires: 

GY = Grobner basis of I(Y), which is an homogeneous ideal, 
GXMinusY = Grobner basis of I(X \ Y), which is an homogeneous ideal, 
EstX = standard basis of degree t of R/I(X) 
EstY = standard basis of degree t of R/I(Y), 
var = list of indeterminates, 
here t is the minimal degree for which (R/ Ix)t as a vector space has the same dimension as the number 
of points in X. 

The Output is the numerator of the ratio of polynomials giving the separator function of Y in X. Its 
denominator is the t-power of the sum of the variables. 



G_to_F_homo : =proc (GY, GXMinusY, EstX, EstY, var, t) 
local E, InY, ly,L, flag, tt, F, i, m, l,v, CC, VV, S : 
with (Groebner ) : 

v : =op (var) : 

E:=EstX minus EstY: 

InY:=[seq(leadterm(GY[i] , tdeg (v) ) , i=l . .nops (GY) ) ] : 
ly : =nops ( InY) : 
L: = [] : 

for m in E do 
flag:=0 : 

while (flag = 0) do 

for i from 1 to ly do if (gcd ( InY [ i ] , m) =InY [ i ] ) then flag:=l: tt:=i: f i : end: 
end : 

L:= [op (L) , GY [tt] *m/ leadterm (GY [tt] , tdeg (v) ) ] : 
end : 

F:=0: i:=0: 

for 1 in L do i:=i+l: F:=F+a[i]*l: end: 
S : =sum ('v[k]',k=l.. nops (var)) : 

CC:=coeffs (normalf (S"t-F, GXMinusY, tdeg (v) ) , [v] ) : 
VV:=solve (\ {CC\ } ) : 
expand (subs (VV, S"t-F) ) : 
end proc; 
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